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ABSTRACT 



c^ ■ We show that the use of the fourth-root trick in lattice QCD with 



staggered fermions corresponds to a non-local theory at non-zero lattice 
53^1 spacing, but argue that the non-local behavior is likely to go away in 

"^ I the continuum limit. We give examples of this non-local behavior in the 

free theory, and for the case of a fixed topologically non-trivial background 

^ ■ gauge field. In both special cases, the non-local behavior indeed disappears 

in the continuum limit. Our results invalidate a recent claim that at non- 
zero lattice spacing an additive mass renormalization is needed because 
of taste-symmetry breaking. 



*Perinanent address: Department of Physics and Astronomy, San Francisco State University, San 
Francisco, CA 94132, USA 



1. Introduction 

Staggered fermions P have long been in use as a method for formulating the 
quark sector of lattice QCD. The main advantages are that they are relatively in- 
expensive when it comes to including sea-quark effects in lattice computations, and 
that they have an exact chiral symmetry in the limit of vanishing bare quark mass. 
The combination of these two advantages makes it possible to reach rather low quark 
masses, which are essential for any serious phenomenological applications of lattice 
QCD. 

These benefits come at a price, however. A theory with one flavor of staggered 
fermion on the lattice yields a theory with four quarks in the continuum limit. This 
is a consequence of fermion species doubling, which is unavoidable in any situation 
in which an exact chiral symmetry is preserved on the lattice. In modern language, 
these four quarks per flavor of lattice staggered fermion are referred to as "tastes." 
Only in the continuum limit does the theory recover a full SU{4) taste symmetry, 
whereas at any non-zero value of the lattice spacing this group is broken to a smaller 
discrete subgroup 0. 

In principle, the four tastes can be given different masses , but this is not what 
is done in practice.^ Instead, each staggered flavor (up, down, or strange) is given a 
single mass, leading to four tastes of degenerate quarks per flavor. In order to obtain 
a theory with only one quark per flavor appearing in sea-quark loops, one reduces 
the number of tastes by taking the fourth root^ of the degenerate-mass staggered 
determinant for each flavor j^]. 

This formulation of the sea-quark sector of QCD does not necessarily correspond 
to a local field theory at non-zero lattice spacing a. The potential lack of locality 
has been the cause for much concern recently j4j about the application of staggered 
fermions to high-precision hadron phenomenology. At issue is: (1) whether the theory 
is local at a 7^ 0, and (2) whether the theory, if non-local at a 7^ 0, becomes local 
in the continuum limit. An alternative way to phrase the second question is to ask 
whether the theory is in the correct universality class. 

In this paper, we will argue (Sec. 2) that the theory with the fourth root of the 
staggered determinant is indeed non-local at non-zero a, but that this does not imply 
that the answer to the second question is negative. We connect the issue of locality to 
the role of taste and chiral symmetries. In Sec. 3, we give some simple examples that 
show how the correct local continuum theory may indeed be obtained. In addition, 
we demonstrate that recent claims about the properties of staggered fermions at non- 
zero a, in particular about the renormalization of the bare mass j3], are incorrect. 
A concluding section summarizes our arguments and results; while the Appendix 
collects some useful properties of various Dirac operators in the taste basis. 



^One reason is that breaking the taste degeneracy requires additional hopping terms in the lattice 
action, which, for a generic choice, make the fermion determinant complex. Also, the existence of a 
partially-conserved continuous chiral symmetry depends on the choice of mass term. 

^In the isospin limit, the up-down sector is represented by a square root of a staggered determi- 
nant with the common light quark mass. 



2. General considerations 

We begin by giving our general argument. Suppose that the theory with the 
fourth root did correspond to a local field theory on the lattice at non-zero a. By 
definition, this would require that the two theories differ only by a local functional. 
In other words, 

Y)ee'\D,tag) = Det(D) exp(-i 5S,ff) , (2.1) 

where Dgtag is the staggered Dirac operator, D is a local lattice Dirac operator that 
describes one quark field in the continuum limit, and 5Seff is a local effective action 
for the gauge field. ^ Saying that 5Seff is local means that it produces only effects at 
the scale of the cutoff. This would imply that, apart from a renormalization of the 
gauge coupling constant, the presence of SS^jf would not affect the behavior at any 
physical length scale that is to be held constant as the lattice spacing is taken to zero. 
It is rather easy to see that this set of assumptions leads to a conflict with what 
we know to be true about the original staggered theory, i.e. the one without the 
fourth root of the determinant. Taking the fourth power of Eq. ()2.1|) . we have 

Det(D,t„,) = mi\D) exp(-<55e//) . (2.2) 

Under our assumption SS^ff is local, and it therefore cannot change the long-distance 
behavior of any correlation function. In particular, it cannot have any effect on the 
Goldstone-boson (GB) masses predicted by the staggered theory deflned by D^tag, 
and those predicted by the theory deflned by 

D4t = D®l , (2.3) 

where the second factor is a unit 4x4 matrix, to be interpreted as the identity matrix 
in taste space. The operator D describes a lattice theory with one taste; in a flnite 
volume, the size of the matrix D is in fact four times smaller than the size of Dgtag- 
Clearly, we have that Det^D^t) = Det^{D), and the lattice theory deflned by D^t has 
a continuous 5'f/(4) taste symmetry. 

We can now compare what we know about the GB spectrum of the two theories. 
In the theory deflned by D^t, there will be flfteen GBs, transforming in the adjoint 
representation of S'f/(4), with possibly a common non- vanishing mass if the operator 
D violates chiral symmetry and/or is not massless . Under our assumption described 
above, SSejf does not change this fact: all long-distance physics would be contained 
in D^t. 

The GB spectrum of the staggered theory is completely different, irrespective 
of the value of the staggered bare quark mass. Of course, in the continuum limit, 
one recovers flfteen degenerate (pseudo-)GBs, but at non- zero lattice spacing, they 
split up into at least four [71 |H], and up to seven [Hj, non-degenerate irreducible 
representations, consistent with the lattice symmetry group of the staggered theory. 
Indeed, at strong couphng OIII], there is only one exact GB (at zero quark mass), 
because of the exact t^(l)e axial symmetry |1UJ . 

■^ Adams has recently emphasized that Eq. 1)2. 1() with SSeff local is indeed the proper definition 
of locality of the rooted theory at a 7^ 0; requiring SSeff = would be too strong. 



It thus becomes clear that our assumption on SSejf cannot be correct. The 
effective action SSefj has to know about the long-distance effects of taste-symmetry 
breaking, and cannot be a local functional of the lattice gauge field. Of course, given 
a local operator D, one can always define SSeff through Eq. ()2.2p or Eq. ()2.ip (as 
long as we consider gauge fields on which D has no exact zero modes, cf. Sec. 3b), 
but what we find is that SS^ff cannot be local. This shows that the theory defined 
by taking the fourth root of the staggered determinant must be non-local at a 7^ 0. 

It also follows that the staggered theory without the fourth root cannot be written 
as an S't/(4)-symmetric local theory at a 7^ 0. In Ref. [5], it was assumed that Eq. ()2.2j) 
held with SSeff local. ^ However, we have shown that such a decomposition is not 
possible. 

What might be confusing is that the left-hand side of Eq. ()2.2j) is the determinant 
of a local operator, Dgtag- Clearly, the determinant, or equivalently the effective action 
Seff = — Tr log{Dstag), is a non-local object. What we observe is simply the fact that 
the non-locahty of Seff cannot be reproduced entirely by the effective action for 
the operator D^t, because of a conflict between the symmetries of Dgtag and D^t at 
non-zero lattice spacing. It is true that Dstag itself can be written as the sum of 
taste-invariant and taste-breaking local operators: 

D,tag = D^l + Y,DA^EA, (2.4) 

A 

with the Sa a set of fifteen 5'f/(4)-algebra valued (hermitian) generators in taste 
space, ^ with D and Da all local. ^ Considering the determinant, however, one has 
that 

Seff = -\ogDet{D,tag) (2.5) 

= -41ogDet(Z^)-logDet [i + 5]D-1Da®Sa) . 

This split of the effective action corresponds to choosing a specific D in Eq. (j2.2j) . Due 
to the presence of D~^, the second term produces a non-local SSeff, even though the 
taste-breaking part of the Dirac operator in Eq. ()2.4j) is local. What we have argued 
above, on the basis of the GB spectrum of the staggered theory without fourth root, 
is that no split of the form of Eq. ()2.2|) exists for which SSeff is local. While it is 
generally accepted that the taste breaking effects of the operator J2a ^a ® '^a vanish 
in the continuum limit, it is precisely the non-locality of SSeff that causes the fifteen 
GBs of the staggered theory to be non-degenerate at a 7^ 0. 

While our argument demonstrates that no local lattice theory exists with a 
fermion determinant equal to the fourth root of the staggered determinant, it leaves 
open the question of whether the non-local behavior persists in the continuum limit. 
Nevertheless, Eq. (j2.5|) lends support to the conjecture that the non-localities vanish 
in this limit. Although SSeff is non-local, the operator J2a^a ® H^ is of order a. 

'^exp{— SSeff) was written as Det(T) in Ref. [S]- 

^We may choose this set to be {£,^J.,^^^i^v,'ii^ii5,^5} with ^^ a set of 4 x 4 matrices satisfying 
{^^Lj^u} = "26^^. 

^Lattice symmetries, such as U{l)e symmetry, further restrict which ^^ can appear, as weU as 
the form the Z?^ can take. 



Thus the effects of SS^ff should vanish when the limit a — > is taken while keeping 
physical momenta fixed/ 



3. Examples 

To make the discussion more concrete, we now give a possible prescription for 
the construction of the operator D^ in Eq. ()2.4|1 . We begin with a massive staggered 
Dirac operator Dgtagi^ = -Dsjag(0)+r72 with bare quark mass m in the one-component 
formalism.^ There exists a gauge-covariant unitary transformation Q^^^ which puts 
the theory into the taste representation of Refs. P^ll3j .^ We may however carry out 
this transformation as a gaussian renormalization-group (RG) blocking, leading to a 
staggered Dirac operator in the taste representation Dtastei^ given by [TH] 

A'aLeM = - + Q'~'^D-l^im)Q^''>^ , (3.1) 

where a is a parameter which appears in the gaussian blocking kernel. We then have 
that 

BetiD^tagM) = Det(G-i) Det(AasteM) , (3.2) 

with 

G-' = -Dstagim) + g(0)tg{0) ^ _/} (^) + 1 , (3.3) 

a a 

where in the last step we have used the fact that the kernel Q^^^ is unitary for this 
"RG blocking." For a — > oo, one recovers a transformation of the type considered in 
Ref. |13j, but we will take a to be finite here. Because G~^ is a Dirac operator with 
a mass of order a in lattice units, all the long-distance physics should be contained 
in Dtaste{m). 

Again following Ref. J3] , one may use Dtaste as the input for n true RG blocking 
steps (in which actual thinning out of fermionic degrees of freedom occurs) with an 
RG blocking kernel Q^*^-*. The "n}^ blocking step takes us from a lattice with spacing 
ttn-i to a lattice with spacing a^ = 2a„_i; oq is defined to be the spacing of the lattice 
associated with Dtastefi = Dtaste and is twice the spacing of the original lattice on 
which Dstag is defined. Blocked operators Dtaste,n and G^^ result from this process, 
with, recursively, 

DtalteM) = - + Q^"^A"akn-l("^)Q^"^^ (3-4) 

a 

G-' = -Aa.te,n-l(m)+g('^)tg(n) 

a 
g(n)g(n)t _ ^i ^ 



where c is a positive constant, and here "1" stands for the Kronecker delta on the 
coarse lattice. One expects that the long-distance physics is entirely carried by 
Dtasteni which is manifestly the sum of a smeared quark propagator and a contact 



^We expect that the continuum hmit will have to be taken before the theory is continued to 
Minkowski space. 

*We will make the dependence on the quark mass explicit for the rest of this paper. 
^The transformation Q^^^ is not unique; see Ref. m] for details. 



term, while Tr log{G^^) is a local functional of the gauge field. The determinants are 
related by 

n 

BetiDstagim)) = Det(Aaste,n(m)) n Det(G'fc ^) , (3.5) 

fc=0 

with Gq^ = G'^ from Eq. (jSISD- While Eq. (jSSl) resembles Eq. ((23), it is fundamen- 
tally different. In Eq. (j3.5|) . both Det^Dgtagi^) and Det(Djaste,n("^)) describe the 
same long-distance physics, and the factor 11^=0 Det(G^^) is expected to be a local 
functional of the gauge field. For any finite n, both D^tagl"^) and Dtaste,n{^ break 
taste symmetry, consistent with our general arguments above. 

The massless one-component action is invariant under f/(l)e transformations |l()j . 

Sx{x) = ie{x)xix) , 5xix) = ie{x)xix) , (3.6) 

because e{x) = (— i)^i+^2+a::3+a;4 anti-commutes with Dgtagi^)- From 

g(°) e = (75 ® ^5) g^°) , (3.7) 

it follows [T3] that Dtaste = Dtastefi satisfics a Ginsparg-Wilson (GW) relation ^H! 

{75 ® ^5, A"aL(0)} = - (75 ® ^5) , (3.8) 

a 

if the original operator Dstag is massless. ^° Using Eqs. ()3.H) and ()3.7j) one can show 
that (75 (S> ^5) -Dtaste (0) is hermitian. Equation ()3.8|1 then implies that the eigenvalues 
of -Diasje(O) lie on a circle in the complex plane crossing the real axis at and a, with 
center at a/2. 

If we start with a massive staggered Dirac operator Dgtagifn) in the one-component 
formalism, we obtain a corresponding massive operator Dtaste{f^) in the taste repre- 
sentation. Using the fact that Dtasteijn) = -Dtaste(O) + m for a = cxd, it is straightfor- 
ward to show for finite a that 



DtasteiO)+m[l-lDtasteK ,j 
Dtaste{m) = 7 — ^- — T — '- . (3.9) 

1 + f (l-iAa.te(0)) 

This operator is local, because the second term in the denominator is small compared 
to the 1 (as long as m -C 1 in lattice units). The eigenvalues still lie on a circle, now 
with center {a /2 + m) / {1 + m/ a) and radius {a/2)/{l + 'm/a). In particular, the two 
possible real eigenvalues are m/(l + ^) and a. 

In general, Dtaste,niO) satisfies a GW relation for any n, since the RG kernels Q^"'^ 
for n = 1, . . . are trivial with respect to Dirac and taste indices. Explicitly, we have 
that |THj 

{75®e5,A"aL,n(0)} = — (75 ® ^5) , (3.10) 

an 
1-c 

1 - c"+i 



10 



Note that this reduces to an ordinary chiral symmetry for a — > cxd. 




Dtaste,n IS not invariant under the full taste SU{A) for any finite n. We may 
construct an SU{A) taste-invariant operator by simply taking the trace in taste space: 

where tr denotes a trace over taste only. This operator is not necessarily massless if 
we set m = 0, but whatever quark mass the theory defined by Z}j„j,.„(m) has, it is 
proportional to the unit matrix in taste space. It is also clear that -Dmt,,n(0) does not 
satisfy a GW relation. 

However, it is straightforward to construct an operator that does obey a GW 
relation. In order to do this, we note that Dinv,n has no fermion species doublers 
for finite a. (We will show this explicitly in Sec. 3a.) Furthermore, the fact that e 
anti-commutes with Dgtagi^), combined with anti-hermiticity of -Dstag(O), implies that 

D;tlgMy = ^ D;,l,ira) e . (3.12) 

Using Eqs. ()3.ip . ()3.4|) . ()3.7p and ()3.11|) . it is then easy to see that 75-Dj„j,_„(m) is 
hermitian. We may thus construct a taste-invariant overlap operator, just as when 
one starts with a Wilson-Dirac operator jl7j : 

-75 sign (^75(^1 Dinv,n{0))] J , (3.13) 

with On given in Eq. ()3.10p . Since this operator is taste invariant, it satisfies a GW 
relation for any taste matrix S: 

{75®S,D,-;j = — (75®S). (3.14) 

It follows that Dov,n is a massless operator. ^^ 

The operator Dov,n can be written as Z^ (g) 1 as in Eq. ()2.3|) . and the resulting 
Z^ is a possible choice for use in Eqs. ()2.H) and ()2.2|) . Obviously, we can only have 
that Dov,n -^ Dtaste,n{jn) for w — > cxD if we take the original one-component staggered 
operator to be massless, so that Eq. ()3.in|l coincides with Eq. (I3.14|l for S = ^5. 
This overlap operator is "natural," because it has been constructed such that the 
difference between Dov,n and Dtaste,n{S^) is expected to be of order a1/a^ = 1/2^" 
113]. The distinction is that, by construction, Dov,n has exact SU{4:) taste symmetry 
(in fact a full chiral SU{4:)l x SU{4:)r), while -Dtaste,n(0) does not. The expectation 
that the difference decreases like 1/2^" arises from the similar expectation that taste 
symmetry is restored in the unrooted staggered theory as we take n — > cx), i.e. as the 
lattice spacing of the original (unblocked) theory is sent to zero. 

The sequence of overlap operators can be made massive by choosing 

Doy,n{m) = Doy,n{0) + Dinv,nim) - A™,n(0) , 171^0, (3.15) 

with m the original bare staggered mass, and Dov,n{0) = Dov,n of Eq. ()3.13p . Our 
choice is different from the massive overlap operator commonly used in the literature. 

^^This is true even if the original operator Dgtagi'm) is not massless, i.e. if -D,;m).n(0) is replaced 
by Dinv,n{m) on the right-hand side of Eq. H3.13|l . 



The reason is that, this way, we maintain the above naturalness property for m ^ 
as well. For details, see Appendix Al. Unless the n ^ oo limit is taken, the two 
theories defined by D taste, nijn) and Dov,nijn) "will not have the same renormalized 
mass; but since the mass in both theories renormalizes multiplicatively, both theories 
are massless for m = 0. This follows from the fact that both -Diaste,n(0) and -Doi,,n(0) 
have a Ginsparg-Wilson-Liischer (GWL) chiral symmetry ^H]- Any of the operators 
Dov,n{^ is a possible choice for D4t in Eq. (|2.3|) . 

3a. The free case 

The free case provides an explicit example of Eq. (j2.'2j) . with SS^jf non-local. 
We choose n = and use Dinvfl{rn) for D^t = D ® 1 on the right-hand side of this 
equation. In the free case, a Q^^^ exists such that [121 HSj 



g(°)D,t,,(m)QWt =^ ^,(^^^l)sinp^ + (^5®^^^5)(l_cosp^)j + (l®l)m (3.16) 

in momentum space. Using this Q^^-* in Eqs. ()3.1|1 and ()3.11|1 . we obtain 

Em (^(7/. ® 1)Pm + 1(75 ® ^^6)^^) + (1 ® 1) (m + i(p2 + m- 

Dtastefiim) = l + ^ + J_(p2+^2) 



Em i(7M ® i)vHl^lH!I^ « (^' + "'^ 

1 + — + ^(p2 + m 



Dinvfiim) = , , 2m , 1 .^2 , -..9N ^ ' (3-17) 



where 



Pf, = smp^ , 

p^ = 2sin(p^/2) , (3.18) 

-2 v^ -2 

p = !LPp 



> 

M 



We see that Di„yQ{m) is a Wilson- like Dirac operator, and thus has no fermion dou- 
bling as long as a is finite. The massless overlap operator of Eq. ()3.13|) in the free 
case is 



DovfiiO) = I 



' l-fA™,o(0)^ 



2y^ p4 



= A™,o(0)+0(p^) . (3.19) 

V V^~ -wwf J 

The argument of the square root is strictly positive as long as a < 2. 

We may now calculate SSeff for the free case from Eqs. ()2.2|) and ()2.5|) . choosing 
D (g) 1 = Dinvflim) and using Eq. (jTTTj) . We find 



/ 



e 



-<5S, 



e// 



1 V^ -, 

iEmP, 



4 



[] 1 + ^^^^^ -2 . (3.20) 



, p^ + (rri + ^(p^ + ^ 



Defining 5Ceff by ^Se// = -Tr (SCeff), we have 5Ceff{p) ~ {J2fiPf,)/{p'^ + m'^) at 
small p (and am <^ 1). This implies that the Fourier transform 5Ceff{x — y) decays 



like inverse powers of the separation x~y (or its components) times a factor e""*'^"^' . 
Because m is a physical scale, SS^jf is non-local, Choosing D = Dovfl{fn) instead 
in Eq. (|2.2j) gives a similar result. While this only demonstrates the non- locality of 
5Seff in the free case, it is clear that in the interacting case the non-locality would 
be gauge- field dependent (see the discussion around Eq. ()2.5|) ). 

If we choose to consider the case of Dtaste,nijn) for ra > 0, the exact expressions 
become more cumbersome. However, using the free-theory results of ^Hl, it is possible 
to show that 



Dtaste,n{m) = ^ 2(7^ ® l)p^ + — ^(TS ® ^^6)^^ 




(3.21) 



for small p, leading to 



SSeff = -8 V ^^ fi^^ + . . . (3.22) 



for small p. This shows explicitly how ^Sg// -^ for ri — * cxd, but also how ^Sg// is 
non-local for any fixed n. 

The free case is rather special in that there are no pions, so the argument of 
Sec. 2 does not apply. This allows for the possibility that there may be other choices 
for the operator D in Eq. ()2.2j) for which (JSg// is local. Indeed, Adams jB] has 
constructed such an operator, which has range Ja/m and 5Seff = 0. However, the 
general features of the GB spectrum show that a similar construction is not possible 
in the interacting case. 

3h. Background with non-zero topological charge 

Another example is provided by the staggered Dirac operator in the background 
of a smooth gauge field with fixed topological charge Q = 1. Here, we take the 
operator D^t = D ® 1 oi Eq. ()2.3p to be an overlap operator, and use it (setting 
m = 0) to define the topological charge of the gauge field under consideration. As 
already mentioned, a possible choice would be one of the Dov,n of Eq. (|3.13|) . but in 
principle any overlap operator will do. 

Our choice of gauge field implies that the operator D will have one exact zero 
mode for quark mass m = 0, and thus one eigenvalue proportional to m when m 7^ 0. 
It follows that Det {D) on the right-hand side of Eq. ()2.2|1 will be proportional to 
m^ and vanish as m — *> 0. The operator Dgtag on the left-hand side of Eq. ()2.2j) will 
not have any exact zero modes for a generic gauge-field configuration (in any topo- 
logical sector) at non-zero lattice spacing. Instead, it will have four non-degenerate 
corresponding eigenvalues 

Aj = m + CiO? , z = 1, . . . , 4 , (3.23) 

with 7 a positive exponent. ^^ In general none of the Cj will be exactly zero: If we 
consider for instance an instanton with radius p, the q will be proportional to p~'^~^. 

i2por generic eigenvalues, one expects that 7 = 1. It could be that 7 = 2 for zero modes. The 
precise value of 7 does not affect our argument. 



It follows that 

n(l + ^)- (3-24) 



—15.9 tr TT / 1 , ^« 



da 



m 



This is just the zero-mode contribution to 



6Seff = -Tr\og(l + J2D-'D 

\ A 



A^ ='A 



(3.25) 



in Eq. ()2.5p . where now D has been chosen to be an overlap operator. ^'^ Thus, the 
1/m signals the dependence of SSejf on the non-local D~^. Note also that SSefj 
diverges in the chiral limit for any non-zero lattice spacing, exhibiting the well-known 
fact that the chiral and continuum limits do not commute III 



3c. Consequences for Reference J^ 

Our results invalidate the basic assumption made in Ref. [Sj, which was that 
6Seff defined by Eq. ()2.2|1 cannot affect long-distance physics even at non-zero a. 
Instead, we find that SSeff has to contain long-distance physics at a 7^ because of 
the mismatched symmetries of D^tag and D4t of Eqs. ()2.2|) and ()2.3p . Contrary to 
what was suggested in Ref. 0, it is not possible to reconcile the theories described 
by Dgtag and D4t by an additive shift in the quark mass. Unlike the theory defined 
by Dstag, the theory defined by D^t has to contain fifteen (pseudo) Goldstone bosons, 
which remain degenerate even if they pick up a mass due to the presence of an 
explicit (S'f/(4)-symmetric) quark mass. Based on a comparison of zero modes of 
Dstag and D4t, Ref. 13] furthermore argues that the quark masses of the two theories 
have to be related by an 0(0^) additive quark mass renormalization. That argument 
fails, however, precisely because the relation between the two theories is non-local. 
Our construction of the overlap operators Do^nijn) demonstrates that in fact SUi^i)- 
symmetric lattice Dirac operators exists which becomes exactly massless when the 
staggered quark mass m is set equal to zero. We emphasize however that any overlap 
operator can be used to invalidate the claim of Ref. [S], as discussed in Sec. 3b. A 
correct description of the approach of the continuum limit as far as the physics of 
GBs is concerned is provided by staggered chiral perturbation theory |7|i2Uj. 



4. Conclusion 

Our main result is a proof in Sec. 2 that the theory defined by the fourth root of 
the staggered fermion determinant does not correspond to a local theory at non-zero 
lattice spacing a. This follows from the fact that SU{A) taste symmetry is broken at 
non-zero a in the unrooted staggered theory. If a local theory corresponding to the 
fourth-root theory existed, one could take four copies of it and construct a local theory 
with exact SU{4) taste symmetry, c/. the theory defined by D^t in Eq. (j2.3|) . The 
S'[/(4) symmetry implies that the fifteen pseudo-Goldstone bosons of this theory must 
be degenerate. On the other hand, it is well known that the 15 pseudo-Goldstone 



13 



In this case the sum over A mcludes a term with :2.a = 1- 



bosons in the staggered theory at non-zero a are non-degenerate because of taste 
violations. There is thus a mismatch in the long-distance physics of the staggered 
and S'f/(4) theories when a 7^ 0. The contradiction implies that the rooted theory 
cannot be local at non-zero a: SSejf, defined through Eq. ()2.2j) . must be non-local. 

The key issue is then whether the non- locality persists in the continuum limit. 
While this remains an open question, the argument given around Eq. ()2.5|) suggests 
that the theory is in the desired universality class as long at the continuum limit is 
taken before the chiral limit. In other words, it appears that locality will be restored 
for a — >• at any m 7^ (c/. Sec. 3b). For recent theoretical results supporting this 
conjecture, we refer to Refs. JT11I2I|. 

While the main argument summarized above stands alone, we have discussed two 
examples that make our reasoning more concrete. The examples are provided by the 
staggered theory in the free case (Sec. 3a) and in the background of a smooth gauge 
field with non-zero topological charge (Sec. 3b). Starting from the staggered Dirac 
operator, we constructed a sequence of overlap operators Dov,n in Sec. 3, which can 
be used to give a fermionic definition of topological charge suited to our arguments. 
In both examples, we find that SSeff is explicitly non-local, but that the non-local 
behavior disappears in the continuum limit. 

Note Added 

Recently, Hasenfratz and Hoffmann [22] have posted a paper that discusses stag- 
gered fermions in the context of the Schwinger model. They present numerical evi- 
dence that the staggered determinant (on both unrooted and rooted ensembles) can 
be made approximately equal to an overlap determinant by adjusting the overlap 
mass appropriately, up to a local effective action. When the quark mass is large com- 
pared to the taste violations, it is not inconsistent with the arguments given here that 
the physics of the overlap and staggered fermions could be approximately the same. 
However, at low quark mass the properties of the GBs guarantee that the physics of 
the two theories must be drastically different; indeed, numerically the matching of 
determinants deteriorates. In QCD, current simulations are in this "low mass" 
region (m ~ a^A^^^) , where staggered chiral perturbation theory (71 1201 121] is the 
appropriate tool. 

Acknowledgments 

We thank Anna Hasenfratz and Tom DeGrand for discussions, and are grateful to 
the Institute for Nuclear Theory at the University of Washington for its hospitality. 
MG also thanks the Physics Department of the University of Rome "La Sapienza" 
for hospitality. MG was supported in part by the Generalitat de Catalunya under the 
program PIVl-2005; both CB and MG were supported in part by the US Department 
of Energy. YS was supported by the Israel Science Foundation under grant no. 173/05. 



Appendix A. Selected properties of taste-basis Dirac operators 



In this appendix we collect a number of useful results pertaining to the three 
families of taste-basis Dirac operators considered in the text: Dtaste,nijn)^ Din^nijn)^ 
and Dov,n{m). 

Al. Construction of Dgy^ni'm) 

Consider a massless overlap operator Dov that satisfies the GW relation 

2 

{75, Dov} = -DovJbDov ■ (A.l) 

a 

Here a = 0(1/ a), where a is the lattice spacing. The choice of a massive overlap 
operator most common in the literature is 

Dov{m) = {l-m/a)Dov + m, (A.2) 

where Dov{0) = Dov is a solution of Eq. ()A.1|) . In fact, as we will explore, there is a 
large freedom in extending the definition of an overlap operator to the massive case. 
Let us spell out the requirements that a massive overlap operator should meet. 
First, the definition (IA.2I) satisfies 



Dovirn) = DoviO) + Zm + 0{m^a, mpa) . (A. 3) 

This is an obvious requirement for any sensible Dov{m). The 0{m^a,mpa) irrelevant 
terms cannot re- introduce any fermion doublers because ma <C 1. Since m is a bare 
mass, we have allowed for an 0(1) multiplicative renormalization factor Z. In the 
case of Eq. ()A.2|) one has Z = 1, but, anticipating less explicit definitions, there is 
nothing wrong in principle with having Z ^ \. Either way, the value of m must be 
adjusted to reproduce the desired renormalized mass. 

The second requirement has to do with the algebraic transformation properties 
under the GWL chiral symmetry |Tni d] (for reviews see Refs. [23 1211 )• The GW 
relation ()A.1|) implies that the operator 

75 = 75(l-(2/«)/^o.), (A.4) 

satisfies 7I = 1. In all relevant cases it will further be true that 75 (or its generaliza- 
tion) is hermitian. A possible choice of the GWL chiral transformation is then given 
by (5'?/' = 75'?/', (5?/' = '?/'75. The GW relation can be rewritten as •y^Dov + Dov% = 0, 
which implies that the fermion action 5*0,, = ijjDovip is invariant under the GWL 
transformation (see also Sec. A3 below). 

In the massive case the fermion action cannot be invariant under the GWL trans- 
formation. Instead, in analogy with an ordinary mass term, and assuming that parity 
is a symmetry, one requires that the mass term be a scalar density that transforms 
into a pseudo-scalar density under the GWL transformation. In fact, this require- 
ment can be rather trivially satisfied. Consider a general bilinear fermion action 



Sp = ipDip, assuming only that Sp is hypercubic and parity invariant. Assume also a 
given GW operator Dov (with in general Dov 7^ D). We introduce the standard chiral 
projectors Prl = i(l ± 75) as well as "hatted" chiral projectors Prl = |(1 ± 75), 
and define ipR^L = Pr,l'4^, '4'rl = '4'Pl,r- Note that hatted projectors are used for ip 
while ordinary projectors are used for ip. One can now split the action into two parts, 

Sf = ^{D^ + Dmass)i^ , (A.5) 

where 

D^ = PrDPl + PlDPr, (A.6) 

Dr^ass = PrDPr + PlDPl. (A.7) 

Under the chiral GWL transformation, Z)^ is invariant, whereas Dmass transforms as 
required for a mass term. 

While the decomposition ()A.5|) is possible for any D, clearly this does not imply 
that any D would qualify as a massive overlap operator. In accordance with Eqs. ()A.3|) 
and ()A.5|1 . we require that a massive overlap operator satisfy 

Dov,x{^) = Do^{0) + 0{mpa) , (A.8) 

P>ov,mass{m) = Z m + O {ui^ a , uipa) , (A. 9) 



where Dov^xh^) ^^^ Dov,mass{.'^) are defined by substituting Dovim) into Eqs. ()A.6|) 
and ()A.7|) respectively.^"^ Note that corrections of 0{m^a) are absent in Eq. ()A.8|) 
because the difference between 75 and 75 is 0{pa), and ordinary chiral symmetry (as 
opposed to the GWL type) would forbid mass terms in Dov,xij^)- Like Eq. ()A.3|) . this 
asserts that Dov{fn) satisfies a GW relation in the limit ?7i — > 0; that the difference 
D ov{fTi) — D ovi'^) is 0{m)\ and that to leading order, this difference is actually linear in 
m. What Eqs. ()A.8|) and ()A.9|) add is that Dov^massijn) transforms as expected under 
the GWL symmetry; the above discussion clarifies that this additional requirement 
can always be met for any operator that already satisfies Eq. ()A.3|) . These properties 
ensure that the mass parameter will be renormalized multiplicatively.^^ 

Here, we will add one new requirement. Under a certain scaling assumption to 
be discussed in Sec. A2, we demand that the sequences Dj„^_„(m) and Dov,nijn) both 
have the same n —* 00 limit as the original RG-blocked operators Dtaste,n{f^)i for any 
m. With Dov,niS^) = Dov,n of Eq. (j3.13p . a massive overlap operator that satisfies all 
the above requirements is (the following is identical to Eq. ()3.15p in the main text) 

Dov,n{m) = Dov,n{0) + Dinv,n{m) - Dinv,n{0) , m y^ . (A. 10) 

Of course, we now define the GWL transformation and the hatted projectors using 
75,n = 75(1 — (2/Q;n)-Do,),n(0)). Equatious ()A.8|) and ()A.9|) follow because, similarly 
to Eq. ()A.3|) . one has Dinv,n{jn) = Dinv,n{0) + Zm + 0{rn?a,mpa). Note that the 
proportionality constant Z is necessary in this case, because Din^^nif^) "was defined 

^''The hatted projectors are always defined with respect to l?ot)(0) = Dov In the case of Eq. IJA.21) 
one has Dov,x{m) = Dov and Dov,massim) = m^PuPR + PlPl) = m + 0{mpa). 

"'^^The (finite) ratio of continuum and lattice Z factors (both evaluated at the same scale) will 
generically be a function of ma. 



such that m is the value of the mass in the original one-component staggered operator. 
The n — i> oo convergence properties will be established in the following subsection. 

Last, we briefly comment on the construction of the low-energy effective theories: 
the Symanzik action and the chiral lagrangian. In the case of Eq. ()A.2|) . the GW chiral 
lagrangian has the same internal symmetries as the continuum chiral lagrangian. The 
situation is slightly more involved in the more general case of Eqs. ()A.8|) and ()A.9|) . 
There, terms proportional to powers of ma appear in the chirally invariant part of the 
Dirac operator, Dov,x{'^- This feature will carry over to the chirally invariant part of 
the Symanzik action. In constructing the corresponding chiral theory, one therefore 
has to include a chirally invariant spurion proportional to ma. The spurion would, 
in effect, make the low-energy constants (LECs) of the chiral theory functions of am. 
Such mass dependence in the LECs could present a practical difficulty in extracting 
chiral physics from a simulation that used Dov{m,) as the fundamental Dirac operator. 
However, there is no theoretical problem in considering Dov{m,), and all the standard 
implications of chiral symmetry are preserved. In particular, the masses of Goldstone 
pions vanish in the chiral limit, for any value of the lattice spacing. 

A2. Scaling and convergence for n ^ oo 

A basic hypothesis of the RG treatment of staggered fermions (with or without the 
fourth root) is that the taste-breaking terms of the RG-blocked operator Dtaste,nijn) 
tend to zero in the limit of infinitely many RG blocking steps [T^ ITlj . The taste- 
breaking part A„ is given explicitly by writing 

Aaste,n("^) = A™,n("^) + A„(m) , (A. 11) 

where Dj„^^„(m) is given by Eq. ()3.11|) . We will hold fixed the coarse- lattice spacing 
Oc = a„ obtained after n blocking steps, implying that Oq = 2~"'a„ goes to zero when 
n is taken to infinity. In the free theory ^^, one can prove that ||acA„|| = 0{2~^). 
In the interacting case no proofs can be given; we will assume that A„ scales in the 
same way, up to logarithm,ic corrections in a^ja^ (that we suppress below). We refer 
to Ref. 1^ for a discussion of the status of this assumption, as well as a more precise 
statement about the gauge fields for which it is expected to apply. 

Under this scaling hypothesis it is trivial that Dtaste,n{^ and Dinv^nijn) have a 
common n ^ oo limit, for any m. Furthermore, by Eq. ()A.10|) . the same will be true 
for Dov,n{jn), provided Do?),n(0) has the same n ^* oo limit as -Dj™,n(0). We will now 
prove this. In the rest of this subsection we set m = and drop the mass argument. 
We begin by substituting Eq. ()A.11|) into 

2 

{75 ® 6, Dtaste,n} = Dtaste,n (TS ® ^s) Dtaste,n , (A. 12) 

which is equivalent to Eq. (jH.lOj) . We then multiply both sides of the resulting equa- 
tion by 1 ® ^5, take the trace over taste indices only, and form the tensor product 
with an arbitrary taste matrix H, obtaining 

2 

{Dinv,n , (75 ® 2)} A™,n(75 ® H)Ant.,n = 

1 

tr((l®e5)A„(75®e5)A„)®S. (A.13) 



2a„ 



We used that A„ is traceless on the taste index (compare Eq. ()3.1H) ). By the scahng 
hypothesis, the right-hand side of Eq. ()A.13|) is 0(2^^"), which tells us by how much 
Dinv,n fails to satisfy the GW relation (J3.14J) . Now introducing 

75,n = 75(1 - (2/a„)A™,n) , (A.14) 

it follows from Eq. ()A.13j) that 7|„ = 1 + 0(2"^''). Hence, 75,™ = sign(75_„) = 75_„ + 
0(2-2"). Finally, inserting this into Eq. ^TW we find Dov,n = (an/2)(l - 7575) = 



A3. Index of Dtaste,n 

Here we address the following issue. The one-component staggered theory has 
an exact chiral symmetry for m = 0, the U{l)e symmetry. The corresponding chiral 
transformations of the continuum four-taste theory are generated by 75 (S> ^^5, and 
they form a non-anomalous subgroup of SU{4)i x 5'f/(4)^.^^ In contrast, after any 
number of RG blocking steps, we obtain the operator Dtaste,n{'n^) which, for m = 0, 
only satisfies the GW relation ()3.10p . While the RG-blocked action is invariant under 
the corresponding GWL transformation, this is not enough to establish that it is a 
symmetry. One must further check that the measure term, arising from this change 
of variables, vanishes. Here we show that this is indeed the case. Again we will set 
??i = and drop the mass argument. 

The variation of the measure is given by 



- tr((75 (g) if>)Dtaste,n/an) = mdex{D taste, n) ■ (A.15) 

We note that, loosely speaking, one expects the index of Dtaste,n to vanish in the 
continuum limit, because taste symmetry is recovered in this limit, and tr(,^5) = 0. 
We will establish the stronger result that the index of Dtaste,n is actually zero on the 
lattice. The precise statement is that the index is zero except possibly on a subspace 
Woo C Uq C U, where U is the (finite volume) gauge-field space, Uq is the (proper) 
subspace where Dtaste,n has at least one exact zero mode, and Woo is a proper subspace 
of Uq defined below. Further, Uq is a measure zero subset of W, and Woo is a measure 
zero subset of Wo. 

One can always choose a basis for the exact zero modes of Dtaste,n such that each 
zero mode ipQ has a definite chirality, 

(75^e5)^o = (75 ® ^5)^0 = ±^0 , (A. 16) 

where, analogous to Eq. ()A.4|) . 

(75^e5) = (75 ® e5)(l - (2/a„)Aa.te,n) • (A. 17) 

By Eq. flA.17|) . ordinary and hatted projectors coincide when acting on a zero mode. 
Also, on a zero mode, the Dirac operator Dtaste,n commutes with the chiral generator, 
as usual. Therefore it is enough to show that the index of Dtaste,n is zero with respect 



16 



This is true when the staggered mass term is introduced as Dgtagi'm) — Dstag{0) + m ^. 



to 75 ® ^5 chirality. This can be done by relating the zero modes of Dtaste,n to those 
of Dstag- Iterating Eq. (|3.4j) we have 

Dralte,n = V^n + QnD;,l^Ql , (A. 18) 

where Q„ = Q'^'^^Q^""^) • ■ ■Q^^^Q^^\ If we gradually vary the gauge field so as to 
approach a configuration where Dtaste,n has an exact zero mode, the norm of D^aste,n 
on the left-hand side diverges. This is possible only if the norm of -D^ag diverges 
too. Thus, not surprisingly, any exact zero mode of Dtaste.n must be obtained via RG 
blocking from an exact zero mode of Dstag- 

Because of f/(l)e symmetry, the spectrum of Dstag consists of imaginary pairs ±iA, 
and the corresponding eigenmodes are related by multiplication with e(x). Since the 
eigenvalues are continuous functions of the gauge fields, and since there are no zero 
modes in the free case, any zero modes that appear must also be paired. We choose a 
chiral basis for the two zero modes, which is always possible. Then, as the gauge field 
changes, the off-diagonal matrix element of Dstag between the modes is not forbidden 
by t/(l)e symmetry, and is thus generically non-zero. This suggests — in accordance 
with standard lore — that exact zero modes exist only on a zero measure subspace 

Wo. 

Using Eq. ()3.7|) it follows from the above discussion that, given a pair of zero 
modes of Dstag, then Dtaste,o must have a corresponding pair of zero modes, with one 
zero mode of each 75 ® ,^5 chirality. The index of both Dstag and Dtaste,o is, thus, 
always zero. The index of Dtaste,n could only be non-zero if the blocking transforma- 
tion Q^^^Q^""^) ■ ■ ■ Q(^) exactly annihilated one of the definite-chirality zero modes of 
Dtastefi but not the other. Generically this will not happen, and the subspace Uqq 
where this does happen therefore has measure zero with respect to Uq. (We leave it 
open whether or not Woo is an empty set.) Assuming that no (interesting) QCD ob- 
servable has a (5-function support on Wqo, the GWL transformation is then a symmetry 
of the RG blocked theory. 
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